Approximating the model of a water distribution network as a Markov decision process

نویسندگان

چکیده

In this paper, our objective is to convert the model of a water distribution network described via Stochastic differential equations (SDE) into Markov decision process (MDP). The motivation behind work that while MDP's represents underlying dynamics for dynamic programming and reinforcement learning, actual best equations, therefore, we would like SDE MDP. We have applied Kushner's chain approximation (MCA) method verified it using novel modified Monte Carlo which can be considered as an alternative well-known MCA. Both methods approximate value function simulation studies show obtained functions from both converge almost same value.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

investigating the feasibility of a proposed model for geometric design of deployable arch structures

deployable scissor type structures are composed of the so-called scissor-like elements (sles), which are connected to each other at an intermediate point through a pivotal connection and allow them to be folded into a compact bundle for storage or transport. several sles are connected to each other in order to form units with regular polygonal plan views. the sides and radii of the polygons are...

translating allusive devices:a survey of a portrait of the artist as a young man by james joyce

تلمیح یکی از عناصری است که تقریباً در همه ی متون ادبی یافت و باعث ایجاد شکاف های فرهنگی می شود. در این تحقیق به عنوان شکلی از بینامتنیت در ترجمه مورد توجه قرار می گیرد. تلاش شده است تا راهکارهای مترجمان برای ترجمه چهار نوع اسامی خاص و عبارات کلیدی تلمیحی (مذهبی، سیاسی، تاریخی و اسطوره ای) موجود در رمانِ چهره مرد هنرمند در جوانی به فارسی بررسی شود. این تحقیق مقایسه ای بر اساس راهکارهای ترجمه تلمیح...

15 صفحه اول

the impact of portfolio assessment on iranian efl students essay writing: a process-oriented approach

this study was conducted to investigate the impact of portfolio assessment as a process-oriented assessment mechanism on iranian efl students’ english writing and its subskills of focus, elaboration, organization, conventions, and vocabulary. out of ninety juniors majoring in english literature and translation at the university of isfahan, sixty one of them who were at the same level of writing...

15 صفحه اول

a study on insurer solvency by panel data model: the case of iranian insurance market

the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.

The Markov Decision Process Extraction Network

This paper presents the Markov decision process extraction network, which is a data-efficient, automatic state estimation approach for discrete-time reinforcement learning (RL) based on recurrent neural networks. The architecture is designed to model the minimal relevant dynamics of an environment, capable of condensing large sets of continuous observables to a compact state representation and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IFAC-PapersOnLine

سال: 2022

ISSN: ['2405-8963', '2405-8971']

DOI: https://doi.org/10.1016/j.ifacol.2022.09.107